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SUMM . . . the unknown DNA sequence area and thereby copying the template 
and synthesizing a complementary strand using DNA polymerases, such as 
Klenow fragment of E. coli DNA 
polymerase I, a DNA polymerase from Thermus 
aquaticus, Taq DNA polymerase, or a modified T7 
DNA polymerase, Sequenase (e.g., Tabor et al . , (1987) 
Proc. Natl. Acad. Sci. U.S.A. 84:4767-4771), in the presence of 
chain- terminating reagents. Here, the. . . sequencing DNA require the 
use of polyacrylamide gel electrophoresis (i.e., PAGE) that can result 
in sequencing artifacts or require detectable labels, such as 
radioisotopes, enzymes, or fluorescent or chemiluminescent moieties. 

SUMM ... a further object herein to provide methods of sequencing 
nucleic acids in an array format using RNA polymerase in which 
nucleic acid probes are immobilized to supports at 

high densities to facilitate mass spectrometric detection. It is also an 
object herein to. 
SUMM In certain embodiments, a double stranded nucleic acid 

molecule encoding a promoter sequence is isolated from a natural source 
(e.g., bacteria, viruses, bacteriophages, plants or eukaryotic 
organisms) or. . . of the coding strand. This single stranded region 
is designed such that it is complementary to a region^of the 
nucleic acid to be sequenced or to a common 

overlapping sequence (e.g., a restriction endonuclease site). In 
preferred embodiments, the promoter-containing nucleic 
acid is covalently coupled via the 3 ■ -end of the noncoding 
strand or 5' -end of the coding strand to a solid support. 
SUMM The nucleic acid to be sequenced containing at least 

a partially single stranded 3 ' -end is hybridized to the complementary 



sequences of the promoter-containing DNA. The nucleic 

acid to be sequenced may be single stranded or double stranded. 

The hybridization of the two nucleic acid molecules 

introduces one or more "nicks" in the hybrid at the junction (s) of the 
adjacent nucleic acid molecules. In certain 

embodiments, nicks in the coding or non-coding strand, preferably the 
coding strand, are ligated by the addition of an appropriate 
nucleic acid ligase prior to initiating transcription 
(i.e., DNA or RNA ligase). 
SUMM ... is analyzed by matrix-assisted laser desorption/ionization mass 
spectrometry (MALDI) and preferably further uses time-of- flight (TOF) 
analysis. The sequence of the nucleic acid is 

obtained by aligning the observed mass of the chain-terminated RNA 
transcripts obtained from sequencing reactions containing each of the. 

SUMM . . . method of sequencing may be used for diagnostic applications to 
determine the presence of genetic alterations in a known target 
nucleic acid. For example, a region of the target 
nucleic acid is amplified and the nucleic 

acid strand corresponding to the noncoding strand is isolated. 

The nucleic acid probe containing the promoter may 

be isolated from a natural source or assembled synthetically by 

hybridizing two complementary oligonucleotides to. . . a promoter 

sequence. A single stranded region of at least 5 nucleotides that is 

complementary to a region of the nucleic, acid to be 

sequenced or to a common sequence is introduced by recombinant means at 
the 3 1 -end of the coding strand. In preferred embodiments, the 
promoter-containing nucleic acid is covalently 

coupled via the 3 ' -end of the noncoding strand or 5 1 -end of the coding 
strand to a solid support. 
SUMM A single stranded 3 ' overhang of the nucleic acid to 

be sequenced, in single stranded or double stranded form, is hybridized 
to the complementary sequences of the noncoding strand and, in some 
embodiments, the nick(s) between one or more nucleic 
acid strands is/are ligated prior to transcription. 

Transcription is initiated using the appropriate RNA polymerase in the 

presence of ribonucleoside triphosphates. 
SUMM When used in array formats, a panel of promoter-containing 

nucleic acid probes may be constructed such that the 

single stranded complementary regions of the target nucleic 

acid may be permuted along the entire sequence, e.g., the coding 

sequence of a gene, allowing for the determination of the 

nucleic acid sequence of the entire gene during a 

single reaction sequence. 
SUMM . . . may be identified using mass spectrometric methods. In 

practicing the methods, a single stranded region of the 3 ' -end of * the 

nucleic acid to be sequenced is hybridized to a 

complementary sequence at the 3 ' -end of the coding strand a 

promoter-containing nucleic acid probe. In preferred 

embodiments, the promoter-containing nucleic acid is 

covalently coupled via the 5' -end of the noncoding strand or 3 ■ -end of 

the coding strand to a solid support. 
SUMM . . . B'-iodo CTP. In certain embodiments, nicks in one or more 

strand may be ligated by the addition of an appropriate nucleic 

acid ligase prior to initiating transcription (i.e.,. adding a 

DNA or RNA ligase) . The mass of the terminated RNA transcripts is. 
DRWD . . . silicon dioxide was reacted with 3-minopropyltriethoxysilane to 

produce a uniform layer of primary amino groups on the surface. A 

heterobif unctional crosslinking agent was then 

reacted with the primary amine to incorporate an iodoacetamide -group . An 
oligodeoxynucleotide containing a 3 ' - or 5' -disulfide (shown as. 
DETD As used herein, the term "nucleic acid" refers to 

oligonucleotides or polynucleotides such as deoxyribonucleic acid (DNA) 
and ribonucleic acid (RNA) as well as analogs of either RNA or DNA, for 
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example, made from nucleotide analogs, any of which are in single or 
double -stranded form. Nucleic acid molecules can be 

synthetic or can be isolated from a particular biological sample using 

any number of procedures which are. 

As used herein, a nucleic acid promoter-containing 

probe refers to a nucleic acid fragment that 

includes a double-stranded region encoding a promoter and a 

single-stranded region that contains at least 5 nucleotides at. 

the coding strand relative to the promoter that is complementary to a 

single stranded region at the 3 ' -end of a nucleic acid 

to be sequenced. 

As used herein, the target nucleic acid is the 
nucleic acid that is sequenced. The target 
nucleic acid will contain or will be modified to 

contain at least about 5 nucleotides whose sequence is known for 
hybridization to the immobilized nucleic acid 
promoter-containing probe. 

As used herein, nucleic acid synthesis refers to any 

process by which oligonucleotides or polynucleotides are generated, 

including, but not limited to processes involving chemical. 

The term "cross-linking agent" is art-recognized, and, as used herein, 

refers to reagents which can immobilize a nucleic acid 

to an insoluble-support, preferably through covalent bonds. Thus, 

appropriate "cross-linking agents" for use herein includes a variety of 

agents that. . . with a functional group present on a surface of the 

insoluble support and with a functional group present in the 

nucleic acid molecule. Reagents capable of such 

reactivity include homo- and hetero-bifunctional reagents, many of which 
are known in the art. Heterobif unctional . 

As used herein, the terms "protein", "polypeptide" and "peptide" are 
used interchangeably when referring to a translated nucleic 
acid (e.g. a gene product) . 

transcription would have occurred if a wild-type or native 
promoter had been used by the RNA polymerase to transcribe the 
nucleic acid in vitro. 

As used herein, a "promoter-containing nucleic acid" 
is a nucleic acid that contains a sequence of 

nucleotides that directs the site-specific binding of an RNA polymerase 
molecule to form an open. 

As used herein, a "coding strand" refers to the nucleic 
acid strand of a promoter-containing nucleic 

acid that has the same polarity as a corresponding mRNA molecule 
initiated from that promoter. 

a material used in mass spectrometry that is a proton donating, 
UV absorbing material, usually organic acid, that forms crystalline 
matrix-nucleic acid structures that are readily 

ionizable during MALDI . An exemplary matrix material is a solution of 
3-hydroxypicolinic acid (3-HPA, 0.7 M. 

Mass spectrometric methods of sequencing nucleic acids are provided. The 
sequencing methods use immobilized nucleic acid 

promoter-containing probes that contain a double stranded region 
encoding a promoter and a single stranded region for hybridizing target 
nucleic acids. The nucleic acid sequence is 

determined by generating a set of nested base-specific chain terminated 
RNA transcripts that are analyzed using mass spectrometry. 

. the art: QP replicase (e.g., see U.S. Pat. No. 5,696,249, 
Re: 35,443 and Eoyang et al . (1971) in Procedures in Nucleic 
Acid Research, Cantoni and Davies, eds . , Volume 2, pp. 829-839, 
Harper and Rowe, N.Y.); bacteria (e.g., E. coli, see Burgess and. 
The selection of the appropriate RNA polymerase for any given 
nucleic acid template to be sequenced is within the 
skill of the skilled artisan and varies according to the nucleic 
acid molecule to be sequenced. The selection may be determined 
empirically following the teachings known to those of skill in the. 



DETD Each nucleic acid promoter-containing probe used in 

the sequencing methods described herein contains a promoter. The 

promoters used in the methods herein may be obtained from any source, 

i.e., recombinant or naturally-occurring promoter elements, or may be 

assembled from synthetic nucleic acid 

oligonucleotide sequences. For example, the nucleic 

acid containing a promoter may be obtained directly from a 

variety of different organisms, such as bacteria, viruses and eukaryotic 

organisms, . . . bla or lac promoters, RSV-LTR promoter and F9-1 

promoter; Stratagene) . The selection of the appropriate promoter will 

depend on the nucleic acid to be sequenced, 

sequencing conditions, and most importantly, on the RNA polymerase 
selected for transcription. 
DETD Immobilization of Nucleic Acid Promoter-containing 
Probes 

DETD In preferred embodiments, the nucleic acid 

promoter-containing probe is immobilized, directly or by means of a 
cross-linking agent, to a solid support provided herein. Preferred 
solid. 

DETD In embodiments of the methods in which a cross -linking reagent is not 
employed, a modified nucleic acid is reacted 

directly with a appropriately f unctionalized surface to yield 
immobilized nucleic acid. Thus, for example, an 
iodoacetyl-modif ied surface (or other thiol-reactive surface 
functionality) can react with a thiol-modif ied nucleic 
acid to provide immobilized nucleic acids. 
DETD . . . acids immobilized on the insoluble support. The cross- linking 
agent (and other reagents used to functionalize the support surface or 
the nucleic acid molecule) can be selected to 
provide any desired spacing of the immobilized nucleic 
acid molecules from the support surface, and to provide any 
desired spacing of the immobilized nucleic acids from each other. Thus, 
steric encumbrance of the nucleic acid molecules can 
be reduced or eliminated by choice of an appropriate cross-linking 
agent. In certain embodiments, the cross -linking reagent can. 
multiple nucleic acids to a single cross-linking moiety. Preferably, the 
cross-linking agent is selected to be highly reactive with the 
nucleic acid molecule, to provide rapid, complete, 

and/or selective reaction. In preferred embodiments, the reaction volume 
of the reagents (e.g., the thiol. 
DETD Modified Nucleic Acid Promoter-containing Probes and 
Linkers 

DETD Preferred nucleic acid promoter-containing probes 

for use herein are "thiol-modif ied nucleic acids," i.e., nucleic acids 

derivatized to contain at least one reactive thiol. . . further 

detail in Example 1, below, nucleic acids containing at least one 

reactive thiol are preferably made by treating a nucleic 

acid containing a 3' or 5 disulfide with a reducing agent, 

which preferably will not compete in subsequent reactions (i.e. will. 

react with an iodoacetimido functionality) . Disulf ide-derivatized 
nucleic acids can be synthesized according to a variety of methods. For 
example, a nucleic acid can be modified at the 3'- 

or 5' -terminus by reaction with a disulf ide-containing modifying a 
reagent. Alternatively, a thiolated primer can by enzymatically or 
non-enzymatically attached to the nucleic acid. A 

5 1 -phosphoramidate functionality can also provide an attachment point 
for a thiol or disulf ide-containing cytosine or deoxycytosine . Examples 
of reducing agents appropriate for reduction of a disulf ide-modif ied 
nucleic acid include: tris- (2-carboxyethyl) phosphine 
(TCEP) (preferably a concentration in the range of 1-100 mM (most 
preferably about 10 mM) ) is reacted at. 
DETD In other embodiments, the nucleic acid is 

immobilized using the photocleavable linker moiety that is cleaved 



during mass spectrometry. Exemplary photolabile cross-linker include, 
but are not . 

DETD A nucleic acid promoter-containing probe can be 

directly linked to a solid support via a reversible or irreversible bond 
between an appropriate functionality (L 1 ) on the target nucleic 
acid molecule (T) and an appropriate functionality (L) on the 
capture molecule. A reversible linkage can be such that it is. 

DETD . . . ammonium group,' in which case, preferably, the surface of the 
solid support carries negative charges which repel the negatively 
charged nucleic acid backbone and thus facilitate 

the desorption required for analysis by a mass spectrometer. Desorption 

can occur either by the heat. 
DETD . . . can serve this purpose and that the donor functionality can be 

either on the solid support or coupled to the nucleic 

acid molecule to be detected or vice versa. 
DETD As noted, at least three version of immobilization are contemplated 

herein: 1) the target nucleic acid is amplified or 

obtained (the target sequence or surrounding DNA sequence must be known 
to make primers to amplify or isolated) ; 2) the primer nucleic 
acid is immobilized to the solid support and the target 
nucleic acid is hybridized thereto to form a promoter 
sequence; or 3) a double stranded nucleic acid 

encoding a promoter (amplified or isolated) is immobilized through 
linkage to one predetermined strand, and in vitro transcription is 
initiated. 

DETD In the embodiments where the primer nucleic acid is 

immobilized on the solid support and the target nucleic 
acid is hybridized thereto, the inclusion of the cleavable 
linker allows the primer DNA to be immobilized at the 5 ' -end so. 

DETD . . . those of skill in the art for immobilizing nucleic acids to 
solid supports may be used herein to link the nucleic 
acid to a solid support. The preferred linkers herein are the 
selectively cleavable linkers, particularly those exemplified herein. 
Other linkers include, . 

DETD . . . cross linking reagents that produce photocleavable linkages) , 
thereby releasing the targeted agent upon exposure to light. In 
preferred embodiments, the nucleic acid is 

immobilized using the photocleavable linker moiety that is cleaved 
during mass spectrometry. 
DETD A variety of chemically cleavable linkers may be used to introduce a 
cleavable bond between the immobilized nucleic acid 
and the solid support. Acid-labile linkers are presently preferred 
chemically cleavable linkers for mass spectrometry, especially MALDI-TOF 
MS, because the acid labile bond is cleaved during conditioning of the 
nucleic acid upon addition of the 3-HPA matrix 

solution. The acid labile bond can be introduced as a separate linker 
group, e.g., the acid labile trityl groups or may be incorporated in a 
synthetic nucleic acid linker by introducing one or 
more silyl internucleoside bridges using diisopropylsilyl , thereby 
forming diisopropylsilyl-linked oligonucleotide analogs. The 
diisopropylsilyl bridge replaces. 

DETD . . . sugar moiety of a nucleotide at positions other than the 3 ' and 
5' position is possible through conventional methods. Also, 
nucleic acid bases can be modified, e.g., as described 
in F. Eckstein, ed. , "Oligonucleotides and Analogues: A Practical 
Approach, " IRL Press (1991) 

DETD In preferred embodiments, modification of a nucleic 

acid, e.g., as described above, does not substantially impair 
the ability of the nucleic acid or nucleic sequence 

to hybridize to its complement. Thus, any modification should preferably 
avoid substantially modifying the functionalities of the nucleic 
acid which are responsible for Watson-Crick base pairing. The 
nucleic acid can be modified such that a non-terminal 
thiol group is present, and the nucleic acid, when 



immobilized to the support, is capable of self -complementary base 
pairing to form a "hairpin" structure having a duplex region. 

DETD . . . rate enzyme turnover. For example, the addition of 4-thio UTP, 
5-bromo UTP, 5-iodo CTP alter the hydrogen bonding of the 
nucleic acid facilitating, at least with some RNA 
polymerases, transcriptional termination and transcript release. 

DETD In preferred embodiments, nucleic acid 

promoter-containing probes are immobilized at high densities to the 
surface of a solid support in an array format. Particularly suitable. 

DETD In one preferred embodiment, a double stranded nucleic 

acid sequence encoding a promoter sequence is isolated from a 
natural source e.g., bacteria, viruses, bacteriophages, plants or 
eukaryotic organisms) or. . . Laboratory Press, New York) . This 
single stranded region is designed such that it is complementary to a 
region of the nucleic acid to be sequenced or to an 
sequence shared between the two nucleic acid 
molecules (e.g., restriction endonuclease site). 

DETD The nucleic acid to be sequenced containing at least 

a partially single stranded 3 ' -end is hybridized according to the 

conditions described herein and known to those of skill in the art to 

the complementary sequences of the promoter-containing DNA. The 

nucleic acid to be sequenced may be single stranded or 

double stranded. The hybridization of the two nucleic 

acid molecules introduces one or more "nick" in the hybrid at 

the j unction (s) of the adjacent nucleic acid 

molecules. Nicks in the coding or non-coding strand, preferably the 
coding strand, can be ligated by the addition of an appropriate 
nucleic acid ligase prior to initiating transcription. 

Methods for ligating nucleic acids are well known to those of skill in 
the art . 

DETD * • ; method' of sequencing may be used for diagnostic applications to 
determined the presence of genetic alterations in a known target 
nucleic acid. For example, a region of the target 
nucleic acid can be amplified using standard methods, 

such as PCR or other amplification methods known to those of skill in 
the. . . art (e.g., see Sambrook et al . , (1989) Molecular Cloning, 
2nd ed., Cold Spring Harbor Laboratory Press, New York). The amplified 
nucleic acid can be denaturated and the strand to be 

sequenced, i.e., the noncoding strand, is isolated or may be used as. 

DETD In preferred embodiments, the nucleic acid 

promoter-containing probe is covalently immobilized on a silica support 
by functionalization of the support with an amino functionality (e.g., 
b y- • • (Pierce, Rockford, 111.). Other homo- and hetero-bif unctional 
reagents which can be employed are available commercially, e.g., from 
Pierce. Finally, a nucleic acid functionalized with 

a thiol group (e.g., at the S'-terminus) is covalently bound to the 
derivatived silica support by reaction of the thiol functionality of the 
nucleic acid molecule with the iodoacetyl 
functionality of the support. 
DETD In certain embodiments, the nucleic acid can be 

reacted with the cross-linking reagent to form a cross-linker/ 
nucleic acid conjugate, which is then reacted with a 
functionalized support to provide an immobilized nucleic 
acid. Alternatively, the cross-linker can be combined with the 
nucleic acid and a functionalized solid support in one 

pot to provide substantially simultaneous reaction of the cross-linking 
reagent with the nucleic acid and the solid support. 
In this embodiment, it will generally be necessary to use a 
heterobifunctional cross-linker, i.e., a cross-linker with two different 
reactive functionalities capable of selective reaction with each of the 
nucleic acid and the functionalized solid support. 
DETD • • • insoluble support can be selectively cleaved (e.g., by 



photolithography) to provide portions of a surface activated for 
immobilization of a nucleic acid. For example, a 
silicon surface, modified by treatment with 3 -mercaptopropyl- 
triethoxysilane to provide thiol groups, can be blocked with a 
photocleavable . . . and be selectively deblocked by irradiation of 
selected areas of the surface, e.g., by use of a photolithography mask. 
A nucleic acid promoter-containing probe modified to 
contain a thiol -reactive group can then be attached directly to the 
support, or, alternatively, a thiol-reactive cross-linking reagent can 
be reacted with the thiol -modified support, followed by (or 
substantially simultaneously with) reaction with a nucleic 
acid to provide immobilized nucleic acids. A nucleic 
acid base or sequence, once immobilized on a support according 
to the methods described herein, can be further modified according to 
known methods, for example, the nucleic acid 
sequence can be lengthened by performing solid-phase nucleic 
acid synthesis according to conventional techniques, including 
combinatorial technique . 
DETD . . . may be identified using mass spectrometric methods. In 

practicing the methods, a single stranded region of the 3 ' -end of the 
nucleic acid to be sequenced is hybridized to a 
complementary sequence at the 3 ' -end of the coding strand a 
promoter-containing nucleic acid probe. In preferred 
embodiments, the promoter-containing nucleic acid is 

covalently coupled via the 5 1 -end of the noncoding strand or 3 1 -end of 
the coding strand to a solid support. 
DETD In certain embodiments, nicks in one or more strand resulting from the 
hybridization of the nucleic acid to be sequenced 
may be ligated by the addition of an appropriate nucleic 
acid ligase prior to initiating transcription (i.e., adding a 
DNA or RNA ligase) . 

DETD . . . hybridized, radiolabeled probes were employed. In cases where a 
5 1 -disulf ide-containing oligodeoxynucleotide was to be immobilized, the 
3' -terminus was radiolabeled using terminal 
transferase enzyme and a radiolabeled dideoxynucleoside 
triphosphate; in a standard reaction, 15 pmol (0.6 uM) of the 
5 ' -disulf ide-containing oligodeoxynucleotide was incubated. 

DETD Use of High Density Nucleic Acid Immobilization to 
Generate Nucleic Acid Arrays 

DETD All primers were synthesized on a commercially available DNA Synthesizer 
using conventional phosphoroamidite chemistry (Sinha et al . (1984) 
Nucleic Acid Res. 12:4539). In vitro RNA transcription 

was performed on a synthetic 55 nucleotide double stranded DNA template. 
The template was . 

DETD All primers were synthesized on a commercially available DNA Synthesizer 
using conventional phosphoroamidite chemistry (Sinha et al . (1984) 
Nucleic Acid Res. 12:4539) . In vitro RNA transcription 

was performed on a synthetic 2 76 nucleotide double stranded DNA template 
(SEQ ID No. . 
CLM What is claimed is: 

1. A method for determining the sequence of a target nucleic 

acid molecule, comprising: a) immobilizing a nucleic 

acid promoter-containing probe on a solid support, wherein: the 

nucleic acid promoter-containing probe comprises at 

least 5 nucleotides at the 3' -end of the coding strand that is 

complementary to a single stranded region at the 3 ' -end of the target 

nucleic acid, and a double -stranded portion that 

comprises the promoter, which is oriented to permit transcription of a 

hybridized target nucleic acid molecule; b) 

hybridizing the target nucleic acid to the 

single-stranded portion of the immobilized nucleic 

acid probe; c) transcribing the target nucleic 

acid with an RNA polymerase to produce a plurality of 

base-specifically terminated RNA transcripts, wherein the RNA polymerase 



recognizes the promoter; . . . the molecular weight value of each 
base-specifically terminated RNA transcript by mass spectrometry; and e) 
determining the sequence of the nucleic acid by 

aligning the base-specifically terminated RNA transcripts according to 
molecular weight. 

3. The method of claim 1, wherein the immobilized nucleic 
acid promoter-containing probe is produced by immobilizing a 
single-stranded molecule that comprises a promoter or the complement of 
a promoter and. 

to form a phosphodiester bond between the 3 » hydroxyl group and the 
5' phosphate group of adjacent strands of the nucleic 
acid probe and the target nucleic acid. 

10. The method of claim 1, wherein prior to immobilization of the 
nucleic acid, the surface of the support is 

derivatized by reacting the surface with an aminosilane to produce 
primary amines on the. . . 

14. The method of claim 12 , wherein the immobilization of the 
nucleic acid probe to a solid support is effected by 
reacting the .thiol-reactive solid support with a nucleic 
acid probe having a free 5 ' - or 3 ' -thiol group, whereby a 
covalent bond between the thiol group and the thiol-reactive solid. 

15. The method of claim l, wherein the nucleic acid 

probe is covalently bound to a surface the solid support at a density of 
at least 2 0 fmol/mm. sup . 2 . 

18. The method of claim 1, wherein the surface comprises a plurality of 

wells comprising the immobilized nucleic acid 

molecule. 

25. The method of claim 1, wherein the hybridization of the 
nucleic acid to be sequence to the solid support 

results in the formation of a nick in the coding strand corresponding 
to. 

28. A method of identifying transcriptional terminator sequences or 
attenuator sequences in a target nucleic acid 
molecule, comprising: a) immobilizing a nucleic acid 
promoter-containing probe on a solid support, wherein the 
nucleic acid promoter- containing probe comprises at 
least 5 nucleotides at the 3 ■ -end of the coding strand that is 
complementary to a single stranded region at the 3 1 -end of the target 
nucleic acid, and a double-stranded portion that 

comprises the promoter, which is oriented to permit transcription of a 
hybridized target nucleic acid molecule; b) 
hybridizing the target nucleic acid molecule to the 
immobilized nucleic acid probe; c) transcribing the 
target nucleic acid with an RNA polymerase to 

produce a sequence -terminated RNA transcript, wherein the RNA polymerase 
recognizes the promoter; and d) determining. . . observed mass of the 
RNA is indicative of the presence of a the terminator sequence or 
attenuator in the target nucleic acid molecule. 
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SUMM . . . thus interfere with the replication or transcription of 

selected target genes. As is known, except for certain RNA viruses and 
nucleic acid-free viroids, DNA is the repository for 
all genetic information, including regulatory control sequences and 
non-expressed genes, such as dormant proviral. 

SUMM ... to inhibit restriction and/or transcription of the target 

double stranded DNA. Based on the known stabilities of the two target 
nucleic acid species (i.e., DNA and RNA), anti-gene 

interference with DNA functioning has longer lasting effects than the 
corresponding antisense inhibition of. 
SUMM . . . chain of nucleotides which are linked to one another by 
phosphate ester linkages. Each nucleotide typically comprises a 
heterocyclic base (nucleic acid base) , a sugar 

moiety attached to the heterocyclic base, and a phosphate moiety which 
esterifies a hydroxyl function of the. 
SUMM . . . sugars or sugar analogs and of the respective "nucleosides" 
wherein such sugars or analogs are attached to a heterocyclic base ( 
nucleic acid base) per se is known, and need not be 

described here, except to the extent such preparation is provided here. 

SUMM The heterocyclic bases, or nucleic acid bases which 

are incorporated in the modified ODNs of the present invention may be 
the naturally occurring principal purine and. 

SUMM Other examples of nucleotides where the crosslinking 
agent is attached to a heterocyclic base, are 

2 ' -deoxy-4-aminopyrazolo [3 , 4 -d] pyrimidine derivatives. The general 
structure of these derivatives is shown below in Formula. 
SUMM . . . include ligands which bind to antibodies labeled with 

f luorophores, chemiluminescent agents, and enzymes. Alternatively, 
probes can be conjugated directly with labels such as 

f luorophores , chemiluminescent agents, enzymes and enzyme substrates. 
Alternatively, the same components may be indirectly bonded through a 
ligand-antiligand . 

SUMM . . . for example, by using DNA synthesizers, by nick-translation, by 



tailing of radioactive bases in the 3 ' end of probes with 
terminal transferase, by copying M13 plasmids having 
specific inserts with the Klenow fragment of 
DNA polymerase in the presence of radioactive dNTP's, 
or by transcribing RNA from templates using RNA polymerase in the 
presence of radioactive. 
SUMM The particular hybridization technique is not essential to the 

invention. Hybridization techniques are generally described in " 
Nucleic Acid Hybridization, A Practical Approach", 

Hames and Higgins, Eds., IRL Press, 1985; Gall and Pardue, Proc . Natl. 

Acad. Sci., U.S.A., 63:378-383. 
SUMM . . . present in the hybridization solution may vary widely. 

Generally, substantial excess of probe over the stoichiometric amount of 

the target nucleic acid will be employed to enhance 

the rate of binding of the probe to the target DNA or RNA. 
SUMM This first aspect of the invention is also directed to a method or 

identifying target single stranded nucleic acid 

sequences, which method comprises utilizing an oligonucleotide probe 
including at least one ODN having a cross-linking agent and a label. 

SUMM . . . one labeled ODN having a cross-linker covalently attached, 

wherein the ODN comprises a sequence complementary to that of the target 
nucleic acid sequence ; 

SUMM An assay for identifying target single stranded nucleic 
acid sequences utilizing a labeled oligonucleotide probe 
including the covalently attached cross-linking agent and comprising the 
above method is contemplated for. . . probe reagent (ODN) having a 
sequence complementary to that of the target nucleic acids; a 
denaturation reagent for converting double -stranded nucleic 
acid to a single-stranded nucleic acid; and 

a hybridization reaction mixture. The kit can also include a 
signal-generating system, such as an enzyme for example, and. 
DETD DNA polymerase 1 (U.S. Biochemicals ) - -8 U/mL 

DETD . . . mL) , nucleotide mix A (6 mL) , Bio-12 -dAPPTP (2 mL) , and H.sub.2 

0 (20 mL) was added DNase (1 mL) and DNA polymerase 

1 (2.4 mL) . The reaction mixture was incubated at 16° C. for 1 
hour. The procedure was repeated using Bio-lldUTP. 

DETD Nucleic acid was isolated by ethanol precipitation 

and hybridized to pHPV-16 slotted onto nitrocellulose. The hybridized 
biotinylated probe was visualized by a. 

DETD The reaction of crosslinking a DNA probe to a target nucleic 
acid sequence contained 1 ug of haloacylamidoalkyl probe and 
10 ng of .sup. 32 P-labeled cordycepin-tailed target in 200 uL of 
0.1M. 

DETD . . . with the HPV system of Example 2 5 where U is 

5- (3-iodoacetamidoprop-l-yl) -2 1 -deoxyuridine . The target was .sup. 32 
P-labeled by cordycepin tailing with terminal 

transferase (Maniatis et al . , "Molecular Cloning- -A Laboratory 

Manual", Cold Spring Harbor Laboratory, 1982, p. 239) and incubated with 

excess probe in. 

DETD . . . the sugar or any heterocyclic base within the ODN. A 

cross-linking agent which has two cross-linking functionalities, such as 
a crosslinking agent having the formula 

--N-- [ (CH.sub.2) .sub. 2 --L].sub.2 (a bifunctional N-mustard) is capable 
of two alkylations, and is therefore considered as two cross-linking. 

DETD (2) INFORMATION FOR SEQ ID NO : 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CTGGATGTUC CTTC 14 



DETD (2) INFORMATION FOR SEQ ID NO : 2 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
AGACAGCACA GAATTCGAAG GAACATCCAG 3 0 
DETD (2) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
ACCGTCCTTG ACACGATGGA CTCC 24 
DETD (2) INFORMATION FOR SEQ ID NO : 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
CTCCAUCGTG TCAAG 15 

DETD (2) INFORMATION FOR SEQ ID NO : 5 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 
NAGAGGAGAA AGGAGAGAGN 2 0 

DETD (2) INFORMATION FOR SEQ ID NO : 6 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
ATATAAGGAG AGAGGAAAGA GGAGACAAA 2 9 
DETD (2) INFORMATION FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
TTGTGGTGGT YGTGTYGTGG TGGG 24 
DETD (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGGAGGAGCA GAGGAGGAGG AGAA 24 
DETD (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTTTCTTTTY GGGGGTN 17 



DETD (2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTTTTAAAAG AAAAGGGGGG ACTGG 2 5 
DETD (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
NCTTTCCTCT CTTTTCCCCN 2 0 

DETD (2) INFORMATION FOR SEQ ID NO : 12 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
AAATACTGGG AGAAAGGAGA GAAAAGGGGA CCCAACGTAT 4 0 
DETD (2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT 
ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA. 
DETD (2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCUCCGA TCGTTGTCAG 50 
DETD (2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 

ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG 50 
DETD (2) INFORMATION FOR SEQ ID NO : 16 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAAAAGCGGT TAGCTCCTTC GGTCCUCCGA TCGTTGTCAG 4 0 
DETD (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



TAGCTCCTTC GGTCCUCCGA TCGTTGTCAG 30 
DETD" (2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGTCCUCCGA TCGTTGTCAG 2 0 

DETD (2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCACCACATC GCCGCATAAC CGATCCCTTC GGTCCUCCGA TCGTTGTCAG 50 
DETD (2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGTTGTGCA AAAAAGCGGT TAGCTTTCCT AACTTUTTAC CTACCACTGA 50 
DETD (2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CCACCACATC GCCGCATAAC TAGCTCCTTC GGTCCUCCGA TCGTTGTCAG 50 
DETD (2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC AACTTUTTAC CTACCACTGA 50 
DETD (2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

AAAAGCGGTT AGCTCCTTCG GTCCUCCGAT CGTTGTCAGA AGTAAGTTG 4 9 
DETD (2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

AAAAGCGGTT AGCTCCTTCG ACCCUCCACT CGTTGTCAGA AGTAAGTTG 4 9 
DETD (2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



AAAAGCGGTT AGCTCCTTCG ACTCUCTACT CGTTGTCAGA AGTAAGTTG 4 9 
DETD (2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AAAAGCGGTT AGCTCCTTCG ACTTUTTACT CGTTGTCAGA AGTAAGTTG 4 9 
DETD (2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGTTATTTTT GAAGATACGA ATTTCUCCAG AGACACAGCA GGATTTGTCA 5 0 
DETD (2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GAAGATACGA ATTTCUCCAG AGACACAGCA 3 0 
DETD (2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CCCCCATGTT GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA 
AGTTGGCCGC A 71 
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